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Background — The Charnley comorbidity classification organizes 
patients into 3 classes: (A) 1 hip involved, (B) 2 hips involved, and 
(C) other severe comorbidities. Although this simple classification 
is a known predictor of health-related quality of life (HRQoL) 
after total hip replacement (THR), interactions between Charnley 
class, sex, and age have not been investigated and there is uncer- 
tainty regarding whether A and B should be grouped together. 

Methods — We selected a nationwide cohort of patients from 
the Swedish Hip Arthroplasty Register operated with THR due 
to primary osteoarthritis between 2008 and 2010. For estimation 
of HRQoL, we used the generic health outcome questionnaire 
EQ-5D of the EuroQol group. This consists of 2 parts: the EQ-5D 
index and the EQ VAS estimates. We modeled the EQ-5D index 
and the EQ VAS against the self-administered Charnley classifica- 
tion. Confounding was controlled for using preoperative HRQoL 
values, pain, and previous contralateral hip surgery. 

Results — We found that women in class C had a poorer EQ-5D 
outcome than men. This effect was mostly due to the fact that 
women failed to improve in the mobility dimension; only 40% 
improved, while about 50% of men improved. Age did not inter- 
act with Charnley class. We also found that the classification per- 
formed best without splitting or aggregating classes. 

Interpretation — Our results suggests that the self-adminis- 
tered Charnley classification should be used in its full capacity 
and that it may be interesting to devote special attention to women 
in Charnley class C. 



The Charnley classification was introduced in 1972 as a 
comorbidity grouping for walking disabilities (Charnley 
1972). Today, it is a widely recognized predictor of patient- 



reported outcomes after hip and knee replacements (Callaghan 
et al. 1990, Garellick et al. 1998, Dunbar et al. 2004, Osten- 
dorf et al. 2004, Hulleberg et al. 2008, Rolfson et al. 2009, 

2011) , even though this has not been unanimously shown 
(McGuigan et al. 1995, Lavernia et al. 2009, Lawless et al. 

2012) . The classification has a simple design with 3 classes: 
(A) 1 hip involved, (B) 2 hips involved but no other joints, and 
(C) some other factor contributing to failure to achieve normal 
locomotion, such as rheumatoid arthritis, senility, hemiplegia, 
or cardiovascular or respiratory disability. Despite its simplic- 
ity, there have been concerns that the classification may suffer 
from interobserver variability (Bjorgul et al. 2010). For exam- 
ple, the proportion of class C reported by McGuigan et al. 
(1995) was around 4% while Lavernia et al. (2009) reported 
30%. To further complicate matters, some split the B class 
into 2 groups, those without an operated contralateral hip joint 
(Bl) and those with an operated contralateral hip joint (B2) 
(Dunbar et al. 2004, Roder et al. 2006). These inconsistencies 
can cause difficulties when clinicians try to use the classifica- 
tion system to inform their patients. 

Self-administered classification as used for the nationwide 
follow-up program ran by the Swedish Hip Arthroplasty Reg- 
ister (SHAR) may limit the interobserver variability. We have 
previously shown that Charnley class C is both common — 
45% in patients eligible for total hip replacement (THR) — and 
a strong predictor of poor patient-reported outcomes (Rolfson 
et al. 2011). Since class C makes up almost half of the study 
population, it is of interest to subdivide this group further and 
determine whether subgroups based on age or sex should be 
used for targeting interventions. Furthermore, class A and B 
are commonly grouped together, thereby possibly limiting the 
ability of the classification to discriminate between patients. 
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Investigation of the full capacity of the classification is there- 
fore of some interest. 

We investigated how the self -reported Charnley classifica- 
tion influenced health-related quality of life (HRQoL) 1 year 
after surgery. We hypothesized that each class in the self- 
reported Charnley classification has a different impact on 
HRQoL after THR and that each class may show an interac- 
tion effect with either age or sex. 



Methods 

Study design and data sources 

This was a nationwide prospective cohort study based on the 
Swedish Hip Arthroplasty Register (SHAR). Since its incep- 
tion in 1979, the SHAR has collected data on all primary 
THRs and any kind of further hip surgery associated with the 
primary intervention performed in Sweden (Karrholm 2010). 
A program for gathering patient-reported outcome measures 
(PROMs) was adopted in 2002 and reached full nationwide 
coverage in 2008 (Rolfson et al. 2011). A PROM question- 
naire is presented to all elective THR patients preoperatively 
and 1, 6, and 10 years after surgery. Each hospital is respon- 
sible for data collection and registration in an online database. 

Ethics 

The study was conducted in accordance with the Helsinki 
Declaration. It was approved by the regional ethics committee 
in Gothenburg, Sweden (Dnr 380-13). 

Participants 

All patients who had THR surgery between January 1, 2008 
and December 31, 2010 due to primary osteoarthritis and who 
were more than 40 years of age were eligible. Because imple- 
mentation of the program was gradual, we only included those 
who had returned the preoperative PROM questionnaire. If 
bilateral surgery was registered during the study period, first 
hip fulfilling previous criteria was selected. If the second sur- 
gery occurred prior to filling out the 1-year PROM question- 
naire, the patient was excluded. We also excluded patients 
with recorded reoperation or death within 1.5 years from the 
index operation. 

During this period, 38,596 total hip replacements were per- 
formed on 35,823 patients. Of these, 28,585 (74%) met the 
selection criteria. The 1-year follow-up response rate during 
this period was 92%. 

Outcomes 

The primary outcomes were the EQ-5D index and EQ VAS 
1 year after surgery. The EQ-5D form consists of 6 items: 5 
questions and the EQ VAS. The questions span 5 dimensions 
of health: (1) mobility, (2) self-care, (3) usual activities, (4) 
pain/discomfort, and (5) anxiety/depression. Each dimension 
has 3 levels of severity, generating a total of 243 combina- 



tions representing different health states. There are different 
value sets that may be used to translate these health states into 
a utility index. We used the Swedish experience-based time- 
trade-off (TTO) value set that translates the answers into a 
score between 0.34 and 0.97, on a scale where 0 represents 
death and 0.97 maximum attainable HRQoL by the EQ-5D 
measurement (Burstrom et al. 2013). The EQ VAS, in turn, 
consists of a visual analog scale (VAS) ranging from 0 to 100 
where the patients are asked to mark their HRQoL, and 100 
corresponds to full health. 

Variables 

The primary exposure variable was Charnley class. In the 
PROM program, it is assessed preoperatively and at 1 year by 
using 2 questions: (1) Do you have any symptoms from the 
other hip?; and (2) Do you have problems in walking for other 
reasons (e.g. pain from other joints, back pain, angina, or any 
other medical condition impairing your walking capacity)? 
We also included in the model preoperative HRQoL, previous 
contralateral THR, and preoperative pain. The pain was mea- 
sured using a VAS where 0 corresponds to no pain and 100 to 
maximum pain. 

Statistics 

We used linear regression to model the outcomes with the pre- 
operative outcome measure, previous contralateral THR, and 
preoperative pain VAS as possible confounders. Age, pain, and 
preoperative outcome measures were modeled using restricted 
cubic splines where the Bayesian Information Criterion (BIC) 
decided optimal degrees of freedom for the spline. Interactions 
were investigated by multiplicative terms between Charnley 
class, age, and sex in the regression model. The significance 
of these was evaluated by ANOVA (Harrell 2001). Non-signif- 
icant interactions were removed from the final model. 

We applied quantile regression to the final model using 3 
quantiles: 0.5 (the median), 0.25 (lower quartile), and 0.1 
(lower decile). Quantile regression is similar to linear regres- 
sion, with the difference that it uses quantiles instead of the 
mean and is thereby free from distributional assumptions 
(Koenker and Bassett 1978). Quantile regression was used 
to investigate how Charnley class influenced those who per- 
formed poorly. 

To understand what dimensions were affected by the Charn- 
ley class, we calculated the number of grades for each EQ-5D 
dimension that the patient changed before and after surgery; 
for example, if a patient had reported moderate problems 
before surgery and improved by having no problems after sur- 
gery, he/she had improved 1 grade, while a patient had dete- 
riorated 1 grade if he/she had started with no problems and 
then had moderate problems at follow-up. This change was 
analyzed using univariate linear regression. 

All analyses were performed using R version 3.0.2 (Team 
2013) and packages rms version 4.1-1 (Harrell 2013) for 
analysis, Gmisc version 0.6.2.1 for transition plots and table 
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Table 1. Basic characteristics of the study population, showing 
absolute numbers (%) for proportions and mean (SD) for continu- 
ous variables. The number of patients with missing observations 
is presented as percentages of the total for that particular group 





VariaHIp 

veil ictuic 


Wnmpn 

V v \J I 1 1 CI 1 


Men 




[I] — 1 \},OC-C-) 






70 (1 0) 


67 (1 0) 


Pain VAS mm 






Mpan (SD1 

1 V 1 0 CM 1 


64 (1 6) 


59 (16) 


MiQQinn 

i v 1 1 ooi i i y 


2 (0%) 


5 (0%) 


f.harnlpv pla<5Q 

i icu i iicy iiiaoo 






A 


7 14 1 ! f44%1 


6 604 (54%1 


B 


1 788 M 1%^ 

i , / UKj y i i /o j 


1 262 110%) 


c 


7 ^87 (45°/^ 

/ ,00/ >o/ 




i v 1 1 ooi i i y 


2 (0%) 


1 (0%) 


Prpvinn^ rnntralatpral THR 

1 1 C V 1 U U O bWI 1 LI dldLIZH Cll 1 1 II I 






No 


13,010 (80%) 


10,102 (82%) 


Yes 


3,312 (20%) 


2,161 (18%) 


HRQoL estimates 






EQ-5D index (preoperative) 






Mean (SD) 


0.72 (0.12) 


0.75 (0.11) 


Missing 


7 (0%) 


8 (0%) 


EQ-5D index (1 year) 






Mean (SD) 


0.87 (0.11) 


0.89 (0.10) 


Missing 


1 ,292 (8%) 


1,015 (8%) 


EQ VAS (preoperative) 






Mean (SD) 


52 (22) 


58 (22) 


Missing 


14 (0%) 


1 1 (0%) 


EQ VAS (1 year) 






Mean (SD) 


75 (21) 


78 (19) 


Missing 


1 ,294 (8%) 


1,014 (8%) 





output, ggplot2 version 0.9.3.1 (Wickham 2009) for graphics, 
and knitr version 1.5 (Xie 2013) for reproducible research. 
Due to non-normally distributed outcomes with heterosce- 
dasticity, the 95% confidence intervals (CIs) were calculated 



using a robust covariance matrix (HC3) through the sandwich 
package version 2.3-0 (Long and Ervin 2000, Zeileis 2004, 
Burstrom et al. 2006), except for quantile regression where 
bootstrapped confidence intervals were used (1,000 boot- 
straps). 



Results 

Complete data from 26,249 patients were available for analy- 
sis. In general, women were slightly older, experienced more 
pain, and categorized themselves more frequently in Charnley 
class C (Table 1). There were no major differences between 
those with full data and those with missing data. 

Main results 

Women in Charnley class C had a poorer outcome than men. 
This interaction was strongest for the EQ-5D index: the 
p- value for EQ-5D index was < 0.001 and that for EQ VAS 
was 0.008. These sex differences were confirmed in the quan- 
tile regression models (Figure 1). Age did not interact with 
Charnley class (the p-value for EQ-5D index was 0.57 and that 
for EQ VAS was 0.30). 

EuroQoL dimensions 

The sex interaction was mediated by the mobility dimen- 
sion where men in class C improved more frequently than 
women. 50% of the men improved at least 1 grade, as opposed 
to 40% of the women. The dimensions typically affected by 
the Charnley class were pain/discomfort and mobility. Pain/ 
discomfort did not differ between class B and C, while both 
mobility and usual activities decreased with higher Charnley 
class (Figure 2). 
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Figure 1 . A. An illustration of the difference in impact of Charnley class depending on sex. The results 
were adjusted to the median age and pain VAS: 69 years and 65 mm. The most frequent EQ-5D index 
value preoperatively was 0.87. The red area indicates values below the preoperative index value. The 
dashed line indicates the predictions from a linear regression model while the solid lines depict quantile 
regression predictions at the different quantiles. B. The same as in panel A but with EQ VAS as outcome. 
This was adjusted to the most frequent EQ VAS value preoperatively: 50 mm. 
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Figure 2. Average change before and after surgery, by EQ-5D dimen- 
sion (mobility, self-care, activities, pain/discomfort, and anxiety/depres- 
sion), each stratified by Charnley class. Each dimension may take on 3 
different ordered values; thus, the change ranges from -2 to 2 and this 
average is shown on the y-axis. The confidence interval bars indicate 
95% certainty of each average. The maximum influence of the Charn- 
ley classes was on the mobility and pain/discomfort dimensions. 
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Figure 3. The crossover between Charnley classes from before to after 
surgery. The patients before surgery are represented on the left while 
those after surgery are represented on the right. Pink corresponds to 
the proportion of women in each group while blue corresponds to the 
proportion of men. The size of the arrow is proportional to the percent- 
age of patients leaving that specific class and the color of the gradient 
corresponds to the sex proportion for each transition according to the 
color bar. 



Table 2. The changes between the preoperative and the postopera- 
tive Charnley class according to sex 



Grouping or subdivision of the classification 

Each Charnley class had a unique impact on the HRQoL 1 year 
after surgery. The model with Charnley class A and B grouped 
had poorer BIC estimates, indicating an inferior model. When 
we separated classes A and B, the contribution of the Charnley 
classification to the model improved by 12% for the EQ-5D 
index and by 1 1 % for the EQ VAS. 

When we analyzed the subdivision of class B, 3,156 patients 
were re-assigned to subclass B2 as they had (according to 
the SHAR) a previously operated contralateral hip. Of these 
patients, 2,749 were from class A and 407 were from class B. 
While this resulted in a sizeable change between classes, the 
regression models did not improve from application of this 
knowledge; both the EQ-5D index and EQ VAS had poorer 
BIC with the split B class. 

Self-reporting 

Women reported Charnley class C more frequently than men, 
both before (45% vs. 36%) and 1 year after surgery (47% vs. 
37%). We also found a considerable change in reported class 
before and after surgery (Figure 3): 39% changed class. Cross- 
overs that should be rare according to the original Charnley 
classification were fairly common; 18% changed either to 
class A from class B or C, or to class B from class C. The dif- 
ference between men and women regarding crossovers was 
small, although women deteriorated to a higher degree to class 
C one year later (Table 2). 

In the 1,284 patients who had been excluded due to con- 
tralateral surgery prior to reporting the one year follow-up 
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8% (542) 


8% (312) 


to C 


67% (4,495) 


58% (2,312) 



form, class A was the least common class (6%). Class B was 
approximately 4 times larger than in the study population 
(46%), while the proportion of class C was similar to that in 
the study population (47%). 

The 407 patients in class B who had previously undergone 
a contralateral THR performed worse than those who had not 
(EQ-5D: -0.025, CI: -0.037 to -0.013; and EQ VAS: -3.4, 
CI: -5.6 to -1.2). This negative effect was primarily due to 
less improvement in mobility, activities, and pain/discomfort 
dimensions, i.e. the EuroQol dimensions mostly affected by 
a THR. 



Discussion 

This is the largest study to be performed in which the Charn- 
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ley classification has been thoroughly analyzed. We found that 
Charnley class B is an independent risk factor and should not 
be grouped with class A. The patient-reported Charnley class 
B also appears to be better for assessment of HRQoL than 
subdividing patients with previous contralateral hip surgery 
into a separate class, as some authors have suggested (Dunbar 
et al. 2004, Roder et al. 2006). 

There was also a sex difference in the impact of Charnley 
class C on EQ-5D indices: women gained less than men. The 
mobility dimension in the EQ-5D appeared to be the main 
cause behind this interaction, where 25% more men than 
women improved 1 year after surgery. A possible explanation 
may be that Charnley class is closely correlated to postopera- 
tive ambulation (Roder et al. 2003) and that women have a 
different rehabilitation pattern (Vincent et al. 2006). It is also 
conceivable that women interpret class C differently, i.e. that 
the men in class C and women in class C are not comparable. 
As almost half of all women belonged to Charnley class C, 
we believe that this is an interesting sub-population to further 
study. 

Interestingly, sex differences were not seen for the EQ VAS 
outcome, possibly due to a different impact of mobility on 
HRQoL between the sexes. Furthermore, the Charnley class 
interaction difference between the EQ-5D and the EQ VAS 
illustrates how the 5 dimensions can capture different quali- 
ties than the EQ VAS, providing important input that helps the 
HRQoL interpretation. 

The Charnley classification is intended to categorize patients 
according to their walking ability and contralateral hip disease. 
Consistent with this intention, the mobility and pain/discom- 
fort dimensions in the EQ-5D index were the dimensions most 
affected by the self-categorized Charnley classification. Per- 
haps less consistent with the intention was the sizeable change 
between the preoperative Charnley class and the class 1 year 
later. It is worth noting that a considerable group crossed over 
to class A from both the B class and the C class, suggesting 
that classification based on a patient-administered question- 
naire is less rigid than originally intended (Charnley 1972). 
Conversely, patients did seem to classify themselves correctly, 
since those who had their contralateral hip operated soon after 
rarely reported class A. This is further supported by the lack 
of improvement when subdividing the B class according to 
previous hip arthroplasties. Interestingly, the few patients with 
a contralateral THR who reported class B performed worse 
even after adjusting for pain and preoperative HRQoL. This 
could be due to fear of surgery or to being genetically predis- 
posed to postoperative pain (Macrae 2008). We conclude that 
even though there is a difference from the original intention, 
patients appear to have a good understanding of their physi- 
cal health, thus possibly explaining why this self-categorized 
Charnley classification is such a strong predictor. 

To our knowledge, no other large cohort studies on Charn- 
ley classification have gone into the details of the classi- 
fication's impact on HRQoL. In order to keep our message 
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simple, we did not investigate other important metrics such as 
pain and satisfaction, which are also included by the SHAR 
PROM program. Generic measures such as the EQ-5D index 
are interesting metrics, as they combine many dimensions of 
health. While these measures are not as sensitive as disease- 
specific measures (Ostendorf et al. 2004), they do contain 
common osteoarthritis characteristics such as pain, activities 
of daily living, and mobility, while at the same time retain- 
ing the option of detecting unexpected correlations in other 
dimensions such as anxiety and depression. In large cohort 
studies, the size of the study group compensates for the lack of 
sensitivity while maintaining the ability to detect unexpected 
correlations. 

The estimated outcomes 1 year after surgery for different 
Charnley classes suggest that some patients will end up with 
values lower than their preoperative level. This estimation 
applies to the average-aged patient with the most common pre- 
operative health state according to EQ-5D/EQ VAS. Shifting 
the references to a lower preoperative EQ-5D index/EQ VAS 
or to younger patients would reduce the risk of not improving. 
We chose this presentation of combined estimates instead of 
single estimates to better illustrate different quantiles. 

The main strength of the present study was the large nation- 
wide cohort with a good response rate to the survey and small 
differences between respondents and non-respondents. We 
believe that the cohort selected was representative while mini- 
mizing confounders such as early reoperations, recall bias due 
to contralateral surgery, and misclassification of surgery indi- 
cation. Another strength was the use of splines to control for 
age confounding, thus minimizing the residual confounding. 
This is important, as Charnley classification is a comorbidity 
indicator and strongly correlates with age. By modeling age 
with a spline, and not in categories or as a straight line, con- 
founding will be negligible. 

We chose in this study to exclude those who died within 
1.5 years. We believe that HRQoL is interesting only if a 
patient survives and can benefit from the HRQoL gained. Fur- 
thermore, if a patient in Charnley class C is advised not to 
go through with surgery both due to risk of dying and poor 
HRQoL outcome, in a sense there will be a double penalty. 

Our main limitation was the scarcity of information per 
patient. During the study period, 2008 to 2010, the registry 
has started collecting interesting confounders such as BMI 
and ASA class. In future studies, these will most likely be 
included in the models. The Charnley classification can also 
be strongly influenced by simultaneous knee osteoarthritis and 
other lower extremity disabilities that we have not been able to 
adjust for (see Figure 4). 

Although major surgery such as THR would be expected 
to affect a patient's HRQoL, it is important to remember that 
other health-related events can occur between surgery and 
follow-up. With limited knowledge of this period, we want to 
stress the observational nature of our study and the need for 
further in-depth analysis. The scarcity applies also to the fact 
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Figure 4. Our directed acyclic graph (DAG) for this study as generated by dagitty.net. 
The green node indicates the exposure of interest, the green lines the exposures effect 
pathways, and the blue node with the "I" indicates the outcome of interest. The blue node 
without "I" is an intermediate to the effect pathway. White nodes are confounders that are 
adjusted for in the models, while red nodes indicate nodes that should be adjusted for 
but that are not available. 



that the follow-up was only 1 year after surgery. The SHAR 
collects PROMs at 6- and 10-year follow-up, but the data are 
still too few to analyze in-depth, although other studies have 
shown negligible change with longer follow-up (Judge et al. 
2013). 

We conclude that the self-administered Charnley classifica- 
tion is a reliable instrument, with interesting properties that are 
easy to use in everyday clinical practice. There is also strong 
evidence that women in Charnley class C fail to improve in 
mobility as much as men. 
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